Monocular 3D Object Reconstruction with GAN Inversion

نویسندگان

چکیده

Recovering a textured 3D mesh from monocular image is highly challenging, particularly for in-the-wild objects that lack ground truths. In this work, we present MeshInversion, novel framework to improve the reconstruction by exploiting generative prior of GAN pre-trained synthesis. Reconstruction achieved searching latent space in best resembles target accordance with single view observation. Since encapsulates rich semantics terms geometry and texture, within manifold thus naturally regularizes realness fidelity reconstruction. Importantly, such regularization directly applied space, providing crucial guidance parts are unobserved 2D space. Experiments on standard benchmarks show our obtains faithful reconstructions consistent texture across both observed parts. Moreover, it generalizes well meshes less commonly seen, as extended articulation deformable objects. Code released at https://github.com/junzhezhang/mesh-inversion .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Monocular On-Line 3D Reconstruction

The use of visual sensing for action generation in unknown environments is an attractive option due to the great representation power of vision, but it is challenging for two reasons. The representations used in vision are often not well suitable for planning, thus requiring complex learning approaches. Furthermore, an active agent needs to make decisions on-line, without the delay of o -line p...

متن کامل

Deformable Surface 3D Reconstruction from Monocular Images

Being able to recover the shape of 3D deformable surfaces from a single video stream would make it possible to field reconstruction systems that run on widely available hardware without requiring specialized devices. However, because many different 3D shapes can have virtually the same projection, such monocular shape recovery is inherently ambiguous. In this survey, we will review the two main...

متن کامل

Monocular Object Detection Using 3D Geometric Primitives

Multiview object detection methods achieve robustness in adverse imaging conditions by exploiting projective consistency across views. In this paper, we present an algorithm that achieves performance comparable to multiview methods from a single camera by employing geometric primitives as proxies for the true 3D shape of objects, such as pedestrians or vehicles. Our key insight is that for a ca...

متن کامل

Monocular Multiview Object Tracking with 3D Aspect Parts

In this work, we focus on the problem of tracking objects under significant viewpoint variations, which poses a big challenge to traditional object tracking methods. We propose a novel method to track an object and estimate its continuous pose and part locations under severe viewpoint change. In order to handle the change in topological appearance introduced by viewpoint transformations, we rep...

متن کامل

Simultaneous Monocular 2D Segmentation, 3D Pose Recovery and 3D Reconstruction

We propose a novel framework for joint 2D segmentation and 3D pose and 3D shape recovery, for images coming from a single monocular source. In the past, integration of all three has proven difficult, largely because of the high degree of ambiguity in the 2D 3D mapping. Our solution is to learn nonlinear and probabilistic low dimensional latent spaces, using the Gaussian Process Latent Variable ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19769-7_39